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In the Claims: 

1 . (original) A method of determining parameter combinations for automated access to World 
Wide Web content that is accessible based on parameters resulting from real user interactions 
with a World Wide Web site, said method comprising: 

maintaining at least one log file containing at least one set of parameters resulting 
from real user interactions with said World Wide Web site; 
analyzing said log file to determine parameter combinations for automated access to said World 
Wide Web content. 

2. (original) A method of determining parameter combinations for automated access to World 
Wide Web content that is accessible based on parameters resulting from real user interactions 
with a World Wide Web site, as per claim 1, wherein said parameters are entries in HTML 
forms, said analyzing step further comprising 



ranked below a predetermined number; and 
wherein said parameter combinations are determined by producing combinations of entries from 
each set of entries. 

3. (original) A method of determining parameter combinations for automated access to World 
Wide Web content that is accessible based on parameters resulting from real user interactions 
with a World Wide Web site, as per claim 2, wherein said parameter combinations are 
determined by producing all combinations of entries from each set of entries. 




ranking entries in each set of entries according to their frequency of occurrence; 



for each set of entries resulting from unlimited text entries, excluding entries 



Page 4 of 17 




10/042,367 

4. (original) A method of determining parameter combinations for automated access to World 
Wide Web content that is accessible based on parameters resulting from real user interactions 
with a World Wide Web site, as per claim 2, wherein entries resulting from limited text entries 
and unlimited text entries have stop words removed and remaining words stemmed. 



5. (original) A method of determining parameter combinations for automated access to World 
Wide Web content that is accessible based on parameters resulting from real user interactions 
with a World Wide Web site, as per claim 1, wherein said log file is maintained by a proxy server 
that logs communications between a client and a Web server resulting from real user accesses to 
said World Wide Web content. 



6. (original) A method of determining parameter combinations for automated access to World 
Wide Web content that is accessible based on parameters resulting from real user interactions 
with a World Wide Web site, as per claim 1 , wherein said content is automatically accessed 
using said parameter combinations. 

7. (currently amended) A method of increasing web crawler penetration of Web databases 
accessible via HTML forms, said method comprising: 

reviewing previous real user form input data q ueries; 

identifying possible form input data q ueries for said Web crawler from said 
previous real user form input data q ueries-by synthesis of entries for any of: predefined 
sets, limited text entries or unlimited text entries; and 
providing said identified form input data q ueries to said Web crawler during an instantiation of 

automated access to said Web databases by said Web crawler. 
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8. (currently amended) A method of increasing web crawler penetration of Web databases 
accessible via HTML forms, as per claim 7, wherein said previous form input data q ueries are 
maintained in a log file. 

9. (original) A method of increasing web crawler penetration of Web databases accessible via 
HTML forms, as per claim 8, wherein said log file is maintained by a proxy server. 



10. (original) A method of increasing web crawler penetration of Web databases accessible via 
HTML forms, as per claim 7, wherein said synthesis comprises: 

ranking any entries for predetermined sets; 

ranking any entries for limited text entries; 

ranking any entries for unlimited text entries; 

excluding entries for unlimited text entries ranked below a predetermined number; 

and 

pairing entries from each set of ranked entries. 



11. (original) A method of increasing web crawler penetration of Web databases accessible via 
HTML forms, as per claim 10, wherein said synthesis further comprises: 

removing stop words and stemming remaining words for entries resulting from 
limited text entries and unlimited text entries. 

12. (original) A method of determining entries for input items of an HTML form for automated 
accesses to content contained in a Web database behind said HTML form, said method 
comprising: 
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maintaining a log of real user entries for said input items; 



analyzing said log to determine entry combinations for said input items. 



13. (original) A method of determining entries for input items of an HTML form for automated 
accesses to content contained in a Web database behind said HTML form, as per claim 12, 
wherein said log file contains at least one set of entries, said analyzing step further comprising 

ranking entries in each set of entries according to their frequency of occurrence; 

for each set of entries resulting from unlimited text entries, excluding entries 
ranked below a predetermined number; and 

wherein said automated parameter combinations are determined by producing 
combinations of entries from each set of entries. 

14. (original) A method of determining entries for input items of an HTML form for automated 
accesses to content contained in a Web database behind said HTML form, as per claim 13, 
wherein said parameter combinations are determined by producing all combinations of entries 
from each set of entries. 

15. (original) A method of determining entries for input items of an HTML form for automated 
accesses to content contained in a Web database behind said HTML form, as per claim 13, 
wherein entries resulting from limited text entries and unlimited text entries have stop words 
removed and remaining words stemmed. 

16. (original) A method of determining entries for input items of an HTML form for automated 

accesses to content contained in a Web database behind said HTML form, as per claim 12, 
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wherein said log file is maintained by a proxy server that logs communications between a client 
and a Web server resulting from real user accesses to said World Wide Web content. 

17. (original) A method of emulating real user access to World Wide Web content dynamically 
accessible via an HTML form, said method comprising: 

maintaining a log containing real user entries into each input item of said HTML 

form; 

ranking entries for each input item according to their frequency of occurrence; 
for each unlimited text entry input item, excluding entries ranked below a 
predetermined number; 

determining combinations of entries from each set of entries; and 
automatically accessing said content using said combinations of entries. 

18. (original) A method of emulating real user access to World Wide Web content dynamically 
accessible via an HTML form, as per claim 17, wherein entries resulting from limited text entries 
and unlimited text entries have stop words removed and remaining words stemmed. 

19. (original) A method of emulating real user access to World Wide Web content dynamically 
accessible via an HTML form, as per claim 17, wherein said log file is maintained by a proxy 
server that logs communications between a client and a Web server resulting from real user 
accesses to said World Wide Web content. 

20. (original) An article of manufacture comprising a computer usable medium having 
computer readable program code embed therein to determine parameter combinations for 
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automated access to World Wide Web content that is accessible based on parameters resulting 
from user interactions with a World Wide Web site, said computer readable program code 
comprising: 

computer readable program code for maintaining at least one log file 
representative of real user interactions with said World Wide Web site; 

computer readable program code for analyzing said log file to determine 
parameter combinations for automated access to said World Wide Web content. 

21. (original) An article of manufacture comprising a computer usable medium having 
computer readable program code embed therein to determine parameter combinations for 
automated access to World Wide Web content that is accessible based on parameters resulting 
from user interactions with a World Wide Web site, as per claim 20, wherein said parameters are 
entries in HTML forms, said computer readable program code for analyzing further comprising 

computer readable program code for ranking entries in each set of entries 
according to their frequency of occurrence; and 

computer readable program code for each set of entries resulting from unlimited 
text entries, excluding entries ranked below a predetermined number; and 

wherein said parameter combinations are determined by producing combinations 
of entries from each set of entries. 

22. (original) An article of manufacture comprising a computer usable medium having 
computer readable program code embed therein to determine parameter combinations for 
automated access to World Wide Web content that is accessible based on parameters resulting 
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from user interactions with a World Wide Web site, as per claim 21 , wherein said parameter 
combinations are determined by producing all combinations of entries from each set of entries. 



23. (original) An article of manufacture comprising a computer usable medium having 
computer readable program code embed therein to determine parameter combinations for 
automated access to World Wide Web content that is accessible based on parameters resulting 
from user interactions with a World Wide Web site, as per claim 21, wherein entries resulting 
from limited text entries and unlimited text entries have stop words removed and remaining 
words stemmed. 



24. (original) An article of manufacture comprising a computer usable medium having 
computer readable program code embed therein to determine parameter combinations for 
automated access to World Wide Web content that is accessible based on parameters resulting 
from user interactions with a World Wide Web site, as per claim 20, wherein said log file is 
maintained by a proxy server that logs communications between a client and a Web server 
resulting from real user accesses to said World Wide Web content. 

25. (original) An article of manufacture comprising a computer usable medium having 
computer readable program code embed therein to determine parameter combinations for 
automated access to World Wide Web content that is accessible based on parameters resulting 
from user interactions with a World Wide Web site, as per claim 20, wherein said content is 
automatically access using said parameter combinations. 
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